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Maximal Orders in the Design of Dense Space- Time Lattice Codes 

Camilla Hollanti, Jyrki Lahtonen, Member IEEE, and Hsiao-feng (Francis) Lu 

Abstract 

We construct explicit rate-one, full-diversity, geometrically dense matrix lattices with large, non-vanishing 
determinants (NVD) for four transmit antenna multiple-input single-output (MISO) space-time (ST) applications. 
The constructions are based on the theory of rings of algebraic integers and related subrings of the Hamiltonian 
quaternions and can be extended to a larger number of Tx antennas. The usage of ideals guarantees a non-vanishing 
determinant larger than one and an easy way to present the exact proofs for the minimum determinants. The idea 
of finding denser sublattices within a given division algebra is then generalized to a multiple-input multiple-output 
(MIMO) case with an arbitrary number of Tx antennas by using the theory of cyclic division algebras (CDA) and 
maximal orders. It is also shown that the explicit constructions in this paper all have a simple decoding method based 
on sphere decoding. Related to the decoding complexity, the notion of sensitivity is introduced, and experimental 
5_j ■ evidence indicating a connection between sensitivity, decoding complexity and performance is provided. Simulations 

in a quasi-static Rayleigh fading channel show that our dense quaternionic constructions outperform both the earlier 
rectangular lattices and the rotated ABBA lattice as well as the DAST lattice. We also show that our quaternionic 
lattice is better than the DAST lattice in terms of the diversity-multiplexing gain tradeoff. 
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Index Terms 

Cyclic division algebras, dense lattices, maximal orders, multiple-input multiple-output (MIMO) channels, 
multiple-input single-output (MISO) channels, number fields, quaternions, space-time block codes (STBCs), sphere 
decoding. 

I. Introduction and background 

Multiple-antenna wireless communication promises very high data rates, in particular when we have perfect 
\q ', channel state information (CSI) available at the receiver. In [1] the design criteria for such systems were developed 
CsJ 1 and further on the evolution of ST codes took two directions: trellis codes and block codes. Our work concentrates 
^ . on the latter branch. 

The very first ST block code for two transmit antennas was the Alamouti code [2] representing multiplication in 
q " the ring of quaternions. As the quaternions form a division algebra, such matrices must be invertible, i.e. the resulting 
. STBC meets the rank criterion. Matrix representations of other division algebras have been proposed as STBCs at 
. £h ! least in [3]-[15], and (though without explicitly saying so) [16]. The most recent work [6]-[16] has concentrated 
^ ' on adding multiplexing gain, i.e. multiple input-multiple output (MIMO) applications, and/or combining it with a 
^ . good minimum determinant. In this work, we do not specifically seek any multiplexing gains, but want to improve 
upon e.g. the diagonal algebraic space time (DAST) lattices introduced in [5] by using non-commutative division 
algebras. Other efforts to improve the DAST lattices and ideas alike can be found in [17]-[19]. 
The main contributions of this work are: 

• We give energy efficient MISO lattice codes with simple decoding that win over e.g. the rotated ABBA [20] 
and the DAST lattice codes in terms of the block error rate (BLER) performance. 

• It is shown that by using a non-rectangular lattice one can gain major energy savings without significant 
increasement in decoding complexity. The usage of ideals moreover guarantees a non-vanishing determinant 
> 1 and an easy way to present the exact proofs for the minimum determinants. 

• In addition to the explicit MISO constructions, we present a general method for finding dense sublattices within 
a given CDA in a MIMO setting. This is tempting as it has been shown in [15] that CDA-based square ST 
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codes with NVD achieve the diversity-multiplexing gain tradeoff (DMT) introduced in [21]. When a CDA is 
chosen the next step is to choose a corresponding lattice or, what amounts to the same thing, choose an order 
within the algebra. Most authors, among which e.g. [11], [15], and [16], have gone with the so-called natural 
order (see Section IIII-B1 Example I3.2I ). In a CDA based construction, the density of a sublattice is lumped 
together with the concept of maximality of an order. The idea is that one can, on some occasions, use several 
cosets of the natural order without sacrificing anything in terms of the minimum determinant. So the study of 
maximal orders is easily motivated by an analogy from the theory of error correcting codes: why one would 
use a particular code of a given minimum distance and length, if a larger code with the same parameters is 
available. 

• Furthermore, related to the decoding complexity, the notion of sensitivity is introduced for the first time, and 
evidence of its practical appearance is provided. Also the DMT behavior of our codes will be given. 

At first, we are interested in the coherent MISO case with perfect CSI available at the receiver. The received 
signal y G C n has the form 

y = hX + n, 

where X G c mxn is the transmitted codeword drawn from a ST code C, h G C m is the Rayleigh fading channel 
response and the components of the noise vector n G C n are i.i.d. complex Gaussian random variables. 

A lattice is a discrete finitely generated free abelian subgroup of a real or complex finite dimensional vector 
space V, also called the ambient space. Thus, if L is a fc-dimensional lattice, there exists a finite set of vectors 
B = {t>i, b2, . . . , bfc} C V such that B is linearly independent over the integers and that 

k 

L = z i^i \ Zi £ Z, hi £ V for all % = 1, 2, . . . , k}. 

i=l 

In the space-time setting a natural ambient space is the space C nxn of complex n x n matrices. When a code is 
a subset of a lattice L in this ambient space, the rank criterion [22] states that any non-zero matrix in L must be 
invertible. This follows from the fact that the difference of any two matrices from L is again in L. 

The receiver and the decoder, however, (recall that we work in the MISO setting) observe vector lattices instead 
of matrix lattices. When the channel state is h, the receiver expects to see the lattice hX. If h / and L meets 
the rank criterion, then hX is, indeed, a free abelian group of the same rank as L. However, it is well possible 
that hL is not a lattice, as its generators may be linearly dependent over the reals — the lattice is said to collapse, 
whenever this happens. 

From the pairwise error probability (PEP) point of view [22], the performance of a space-time code is dependent 
on two parameters: diversity gain and coding gain. Diversity gain is the minimum of the rank of the difference 
matrix X — X' taken over all distinct code matrices X, X' G C, also called the rank of the code C. When C is 
full-rank, the coding gain is proportional to the determinant of the matrix (X — X')(X — X') H , where X H denotes 
the transpose conjugate of the matrix X. The minimum of this determinant taken over all distinct code matrices 
is called the minimum determinant of the code C and denoted by 5c- If $c is bounded away from zero even in the 
limit as SNR — > oo, the ST code is said to have the non-vanishing determinant property [8]. As mentioned above, 
for non-zero square matrices being full-rank coincides with being invertible. 

The data rate R in symbols per channel use is given by 

R=hog {S{ (\C\), 

n 1 1 

where \S\ and \C\ are the sizes of the symbol set and code respectively. This is not to be confused with the rate 
of a code design (shortly, code rate) defined as the ratio of the number of transmitted information symbols to 
the decoding delay (equivalently, block length) of these symbols at the receiver for any given number of transmit 
antennas using any complex signal constellations. If this ratio is equal to the delay, the code is said to have full 
rate. 

The correspondence is organized as follows: basic definitions of algebraic number theory and explicit MISO 
lattice constructions are provided in Section [TIJ As a (MIMO) generalization for the idea of finding denser lattices 
within a given division algebra, the theory of cyclic algebras and maximal orders is briefly introduced in Section 
Hill In Section [TV] we consider the decoding of the nested sequence of quaternionic lattices from Section [TT] A 
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variety of results on decoding complexity is established in Section [TV] where also the notion of sensitivity is taken 
into account. Simulation results are discussed in Section [V] along with energy considerations. Finally in Section 
IVTl the DMT analysis of the proposed codes will be given. 

This work has been partly published in a conference, see [3] and [4]. For more background we refer to [22]-[29]. 

II. Rings of algebraic numbers, quaternions and lattice constructions 

We shall denote the sets of integers, rationals, reals, and complex numbers by Z, Q, R, and C respectively. 
Let us recall the set 

EI = {a± + a2i + «3j + a^k I at £ R Vi}, 

where i 2 = j 2 = k = — 1, ij = k, as the ring of Hamiltonian quaternions. Note that EI ~ C © Cj, when the 
imaginary unit is identified with i. A special interest lies on the subsets 

HIc = {«l + 0-21 + asj + a^k | a* 6 Z Vt} C EI and 

= { a iP + a 2i + a%j + a 4 /c \ at & Z Vi, p = -(1 + i + j + A;)} C EI 

called the Lipschitz' and Hurwitz' integral quaternions respectively. 
We shall use extension rings of the Gaussian integers 

Q = { a + U I a, b G Z} 

inside a given division algebra. It would be easy to adapt the construction to use the slightly denser hexagonal ring 
of the Eisensteinian integers 

£ = {a + buj j a, b 6 Z}, 

where u 3 = 1, as a basic alphabet. However, the Gaussian integers nicely fit with the popular 16-QAM and QPSK 
alphabets. Natural examples of such rings are the rings of algebraic integers inside an extension field of the quotient 
fields of Q, as well as their counterparts inside the quaternions. To that end we need division algebras A that are 
also 4-dimensional vectors spaces over the field Q(i). 



A. Base lattice constructions 

Let now £ = e 7 ™/ 8 (resp. £ = e 7 ™/ 4 = (1 + i)/\f% be a primitive lQ th (resp. 8 th ) root of unity. Our main 
examples of suitable division algebras are the number field 

L = Q(C), 

and the following subskewfield 

H = Q(0 © jQ(0 Q EI 

of the Hamiltonian quaternions. Note that as zj = jz* for all complex numbers z, and as the field Q(£) is stable 
under the usual complex conjugation (*), the set H is, indeed, a subskewfield of the quaternions. 

As always, multiplication (from the left) by a non-zero element of a division algebra A is an invertible Q(i)-linear 
mapping (with Q(i) acting from the right). Therefore its matrix with respect to a chosen Q(i)-basis B of A is also 
invertible. Our example division algebras L and H have the sets Bl = {LC>C 2 >C 3 } an d Bh = {l)£>j, j£} as 
natural Q(i)-bases. Thus we immediately arrive at the following matrix representations of our division algebras. 

Proposition 2.1: Let the variables ci, 02,03,04 range over all the elements of Q(i). The division algebras L and 
H can be identified via an isomorphism with the following rings of matrices 

ic 4 ic 3 ic 2 \ 1 
ci ici ic3 I 

C2 C\ iC4 I 

C3 c 2 Cl / J 



L = < 



M L = M L (ci,C2,c 3 ,c 4 ) 
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\ c 4 



4 
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M = M( Cl ,c 2 ,c 3 ,c 4 ) 



( 


Cl 


ic 2 
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Cl 
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C4 
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" * 

-zc 2 
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The isomorphism (j) from L into the matrix ring is determined by Q(i)-linearity and the fact that ( corresponds 
to the choice c 2 = 1, c\ = 03 = C4 = 0. The isomorphism <p from H into the matrix ring is determined by 
(Q)(i)-linearity and the facts that £ corresponds to the choice C2 = 1, c\ = 03 = C4 = 0, and j corresponds to the 
choice C3 = 1, ci = C2 = C4 = 0. In particular, the determinants of these matrices are non-zero whenever at least 
one of the coefficients ci, 02,03,04 is non-zero. ■ 

In order to get ST lattices and useful bounds for the minimum determinant, we need to identify suitable subrings 
S of these two algebras. Actually, we would like these rings to be free right (/-modules of rank 4. This is due 
to the fact that then the determinants of the matrices of Proposition 12.11 that belong to the subring (f)(S) must be 
elements of the ring Q. We repeat the well-known reason for this for the sake of completeness: the determinant of 
the matrix representing the multiplication by a fixed element x G S does not depend on the choice of the basis B 
and thus we may assume that it is a (/-module basis. However, in that case xB C S, so the matrix will have entries 
in Q as all the elements of S are (/-linear combinations of B. The claim follows. 

In the case of the field L we are only interested in its ring of integers Ol = Z[£] that is a free (/-module with the 
basis Bl- In this case the ring <P(Ol) consists of those matrices of L that have all the coefficients ci, 02, 03,04 G Q. 
Similarly, the (/-module 

c = g © iG © jg © ag 

spanned by our earlier basis Bh is a ring of the required type. We call this the ring of Lipschitz' integers of H. 
Again </>(£) consists of those matrices of H that have all the coefficients 01,02,03,04 G (/. While Ol is known to 
be maximal among the rings satisfying our requirements, the same is not true about C. The ring My also has an 
extension of the prescribed type inside H, called the ring of Hurwitz' integers o/H. This ring, denoted by 

is the right (/-module generated by the basis Bhw = {p, p£, j, where again p = (1 + i + j + k)/2. The fact 
that TL is a subring can easily be verified by straightforward computations, e.g. £p = p£ — j£. For future use we 
express the ring TL in terms of the basis Bh of Proposition 12.11 It is not difficult to see that the element 

Q = ci + £c 2 + jc 3 + j£c 4 e H 

is an element of TL, if and only if the coefficients q satisfy the requirements (1 + i)ct G (/ for all t = 1,2,3,4 
and ci + 03, 02 + 04 G (/. As the ideal generated by 1 + i has index two in (/, we see that C is an additive, index 
four subgroup in TL. We summarize these findings in Proposition 12.21 The bound on the minimum determinant is 
a consequence of the fact that all the elements of (/ have a norm at least one. 

Proposition 2.2: The following rings of matrices form ST lattices with minimum determinant equal to one. 

Li = {M L (01,02,03,04) I 01,02,03,04 G (/} , 
L 2 = {M(ci, c 2 , 03,04) I 01,02,03,04 G (/} , 

L 3 = j M (01,02,03,04) I 01,02,03,04 G — ^ G, ci +c 3 G (/,c 2 +c 4 G (/I . 



Remark 2.1: The lattice Li is quite similar to the DAST lattice in the sense that all of its matrices can be 
simultaneously diagonalized. See more details in Section |IV-B| The lattice L 2 , for its part, is a more developed 
case from the so-called quasi-orthogonal STBC suggested e.g. in [30]. The matrix M(ci, c 2 , 03, 04) of Proposition 
12. 11 can also be found as an example in the landmark paper [6], but no optimization has been done there by using, 
for example, ideals as we shall do here. 
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A drawback shared by the lattices L\ and L 2 is that in the ambient space of the transmitter they are isometric 
to the rectangular lattice Z 8 . The rectangular shape does carry the advantage that the sets of information carrying 
coefficients of the basis matrices are simple and all identical which is useful in e.g. sphere decoding. But, on the 
other hand, this shape is very wasteful in terms of transmission power. Geometrically denser sublattices of Z 8 , e.g. 
the checkerboard lattice 



(xi, 



Xi = (mod 2) 



i=l 



and the diamond lattice 



E$ = < (xx, ...,x 8 ) e 



X j X n 



8 ) 
j (mod 2), ^x, = (mod 4) \ 

i=i ) 



are well-known (cf. e.g. [31]). However, we must be careful in picking the copies of the sublattices, as it is the 
minimum determinant we want to keep an eye on (see Remark I2.3I ). 



B. Dense sublattices inside the base lattice L2 

As our earlier simulations [3], [4] have shown that L 2 outperforms L\, we concentrate on finding good sublattices 
of L 2 . The units of the ring L2 are exactly the non-zero matrices whose determinants have the minimal absolute 
value of one. Thus a natural way to find a sublattice with a better minimum determinant is to take the lattice 4>(X), 
where X C 5 is a proper ideal. This idea has appeared at least in [3], [4], and [8]. Even earlier, ideals of rings of 
algebraic integers were used in [27] to produce dense lattices. Let us first record the following simple fact. 

Lemma 2.3: Let A and B be diagonalizable complex square matrices of the same size. Assume that they commute 
and that their eigenvalues are all real and non-negative. Then 

det (A + B) > det A + detB 

with a strict inequality if both A and B are invertible. 

Proof: As A and B commute, they can be simultaneously diagonalized. Hence, we can reduce the claim to 
the case of diagonal matrices with non-negative real entries. In that case the claim is obvious. ■ 

In Proposition 12.41 we give a construction isometric to the checkerboard lattice Dg 
Proposition 2.4: Let X be the prime ideal of the ring Q generated by 1 + i. Define 

%c = {(ci + £,c 2 ) + j(c 3 + £c 4 ) G C I ci + c 2 + c 3 + c 4 G J}. 

Then Tc is an ideal of index two in C. The corresponding lattice 

L 4 = {M(ci, C2, C3, C4) G L2 I Cl + C2 + C3 + C4 G 2} 

is an index 2 sublattice in Furthermore, the absolute value of det(MM ), M G L4 \ {0}, is then at least 4. 

Proof: It is straightforward to check that Xc is stable under (left or right) multiplication with the quaternions 
£ and j, so Xc is an ideal in C. 

Let us consider a matrix M G L4 and write it in the block form 

(A -B H 
M ={b A" 

We see that 

tII ( AA H + BB H 

1V1 1VJ 

and 



MM AA H + BB H 



AA H + BB H = f ? k * 

k a 
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where a = Ylj=i \ c j\ * s a non-negative integer and k = —ic\c\ + c 2 c\ — ic%c\ + C4C3 is a Gaussian integer with 
the property k* = ik. We are to prove that det MM H = [a 2 — |&| 2 ) 2 > 4. Assume first that 03 = 04 = 0, i.e. the 
block B = 0. Then det (A) is the relative norm 



det(A)=i^( Cl +£c 2 ), 

which is a Gaussian integer. As c\ + £c 2 is a non-zero element of the ideal X, we conclude that det (A) is a non-zero 
non-unit. Therefore det(j4) det(j4^) > 2, and the claim follows. 

Let us then assume that both A and B are non-zero. Then det (A) and det (B) are non-zero Gaussian integers 
and have a norm at least one. The matrices A, A H , B, B H all commute, so by Lemma |2~31 we get 

det(MM H ) > det(AA H ) 2 + det(BB H ) 2 > 2. 

As det(MM^) = (a 2 - \k\ 2 ) is a square of a rational integer, it must be at least 4. ■ 

Remark 2.2: It is easy to see that in the previous proposition a + bi G X, if and only if a + b is an even integer. 
Thus geometrically the matrix lattice L4 is, indeed, isometric to Dg. 

We proceed to describe two more interesting sublattices of L 2 with even better minimum determinants. To that 
end we use the ring TL (or the lattice L3). The first sublattice is isometric to the direct sum D4 _L D4 [31] of two 
4-dimensional checkerboard lattices. 

Proposition 2.5: Let again X be the ideal (1 + i)Q. The lattice 

L 5 = {M(ci,c 2 ,c 3 ,c 4 ) £ L 2 \ci + c 3 ,c 2 + c 4 G X} 

has a minimum determinant equal to 16. The index of L4 in L 2 is 4. 

Proof: The coefficients c\ and C3 can be chosen arbitrarily within Q. The the ideal X has index 2 in Q, and 
the coefficients c 2 and C4 now must belong to the cosets c\ + X and C3 + X respectively. Whence, the index of L5 
in L 2 is 4. The matrices A in the lattice L5 are of the form A = (1 + i)M, where M is a matrix in the lattice L3 
of Proposition 12.21 Thus det{AA H ) = 16det(MM^) and the claim follows from Proposition 12.21 ■ 

The diamond lattice E% can be described in terms of the Gaussian integers as (cf. [32]) 

£ 8 = -1— <j(ci,c 2 ,C3,c 4 ) G C? 4 I ci +l = c t +X, t = 2,3,4, Vc f G 20 1 . 

By our identification of quadruples (ci, c 2 , c 3 , C4) G ^ 4 and the elements of H it is straightforward to verify that 
(1 + i)E 8 has {2, (1 + i) + (1 + (1 + + (1 + 1 + £ + j + j£} Q £ as a 0-basis, whence the set 
{1 + i, 1 + ^, £ + j, p + p£} C is a (/-basis for Eg. By another simple computation we see that Eg = Tt(l + £), 
i.e. Eg is the left ideal of the ring TL generated by 1 + £. 

Proposition 2.6: The lattice 

L 6 = |m(ci,c 2 ,C3,c 4 ) G L 2 | ci +X = c t +X, t = 2,3,4, G 2Q | 

is an index 16 sublattice of L 2 . Furthermore, the minimum determinant of L6 is 64. 

Proof: Let M/ = M(l, 1, 0, 0) be the matrix <p(l + £) under the isomorphism of Proposition 12.11 We see that 
det(MiMj ) = 4. By the preceding discussion any matrix A of the lattice Lq has the form A = MMj(l+i), where 
M is a matrix in L 3 . As in the proof of Proposition [231 we see that det AA H = 16det(M/M/ f ) det(MM H ). The 
claim on the minimum determinant now follows from Proposition 12.21 We see that the coefficient c\ can be chosen 
arbitrarily within Q. The coefficients c 2 and C3 then must belong to the coset c\ +X, and C4 must be chosen such 
that ci + c 2 + C3 + C4 G 2Q = X 2 . As X has index two in Q, we see that the index of Lq in L 2 is 16 as claimed. ■ 

Remark 2.3: We have now produced a nested sequence of lattices 

2Z 8 = 2L 2 C L 6 C L 5 C L 4 C L 2 = Z 8 (C L 3 ). (1) 
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TABLE I 

Lattices from a coding theoretical point of view 
Z/2 «-» The 8-dimensional rectangular grid Z 8 «-» no coding 

I 

1/4 <-> The checkerboard lattice «-» overall parity check code of length 8 

I 

1/5 «-> The lattice D4 _L D4 «-» two blocks of the overall parity check code of length 4 

I 

1/6 «-» The diamond lattice E% extended Hamming-code of length 8 



We concentrate on the lattices that are sandwiched between 2Z 8 and 1? . It is worthwhile to note that these lattices 
are in a bijective correspondence with a binary linear code of length 8 by projection modulo 2, see Table U above. 
As it happens, within this sequence of lattices the minimum Hamming distance of the binary linear code and the 
minimum determinant of the lattice are somewhat related. 

Thereupon it is natural to ask that what if we simply concatenate the use of L2 with a good binary code (extended 
over several L2-blocks, if needed), and be done with it. While the binary linear codes appearing above are the 
first ones that come to one's mind, we want to caution the unwary end-user. Namely, it is possible that there are 
high weight units in the ring in question. If such binary words are included, then the minimum determinant of the 
corresponding lattice is equal to 1, i.e. no coding gain will take place. E.g. the unit (1 — £ 3 )/(l — £) = 1 + £ + £ 2 = 
(1 + i) +£ of the ring C corresponds to the matrix M(l + i, 1, 0, 0) of determinant 1, and thus we must not allow 
such words of weight 3. If the lattice L\ were used, the situation would be even worse, as then we have units like 
(1 — C 7 )/(l — C) i n the rm g Cl that would be mapped to a word of Hamming weight 7. A construction based on 
ideals provides a mechanism to avoid this problem caused by high weight units. 

III. Cyclic algebras and orders 

In Section [TT] we produced a nested sequence £T|) of quaternionic lattices with the property that as the lattice 
gets denser after rescaling the increased minimum determinant back to one, the BLER perfomance gets better. As 
the sequence (Q]) lies within a specific division algebra, an obvious question evokes how to generalize this idea. 
The theory of cyclic division algebras and their maximal orders offer us an answer. When designing square ST 
matrix lattices for MIMO use, cyclic division algebras are of utmost interest as it has been shown in [15] that 
a non-vanishing determinant is a sufficient condition for full-rate CDA based STBC-designs to achieve the upper 
bound on the optimal DMT, hence proving that the upper bound itself is the optimal DMT for any number of 
transmitters and receivers. Given the number of transmitters n, we pick a suitable cyclic division algebra of index 
n (more on this in a forthcoming paper, see Section IViTl and [33]. See also [15] ). The matrix representation of 
the algebra, with some constraints on the elements, will then correspond to the base lattice, similarly as did the 
lattice L2 in Section [H] Now in order to make the lattice denser, we choose the elements in the matrices from an 
order. The natural first choice for an order is the one corresponding to the ring of algebraic integers of the maximal 
subfield inside the algebra. The densest possible sublattice is the one where the elements come from a maximal 
order. 

All algebras considered here are finite dimensional associative algebras over a field. 
A. Cyclic algebras 

The basic theory of cyclic algebras and their representations as matrices are thoroughly considered in [[34], 
Chapter 8.5] and [6]. We are only going to recapitulate the essential facts here. 

In the following, we consider number field extensions E/F, where F denotes the base field. F* (resp. E*) denotes 
the set of non-zero elements of F (resp. E). Let E/F be. & cyclic field extension of degree n with the Galois group 
Gal(E/F) = (a), where a is the generator of the cyclic group. Let A = (E/F,a,j) be the corresponding cyclic 
algebra of index n, that is, 

A = E@uE@ u 2 E © • • • © u^E, 
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with u G A such that xu = ua(x) for all x G E and u n = 7 G F*. An element a = xq + ux\ + ••• + « 
has the following representation as a matrix A = 

/ x 70-(x n „i) 7CT 2 (x n _ 2 ) ••• 7cr" _1 (xi) \ 
xi a(x ) 7CJ 2 (x n _i) 
x 2 cr(xi) 



n-l 



x n —\ G ,A 



ct 2 (x J 



7°"" (^2) 
7a™- 1 ' 



l (^3) 



\ x n -i a(x n - 2 ) a 2 (x n _ 3 ) 
Let us compute the third column as an example: 



(2) 



a n - l (x Q ) J 



2 2 
u 1— ► a« 



xqu 2 + uxiu 2 + • • • + ii n 1 x„_iu 2 



= u<r(xo)u + n 2 cr(xi)u + • • • + 7a(x n _i)u 
= u 2 a 2 (x ) + u 3 cr 2 (xi) H + U7<7 2 (x n _i), 

and hence as the third column we get the vector 

(7CT 2 (x n _2),7CT 2 (2 n _l),CT 2 (>o), • • • ,CT 2 (X„_ 3 )) T . 



Let us denote the ring of algebraic integers of E by Oe- A basic, rate-ra MIMO STBC C is usually defined as 



C 



' ( Xq 7Cr(x„_i) 
x\ cr(x ) 
x 2 a(xi) 



ja n - 1 (x 1 ) \ 
-ya n ~ 1 (x 2 ) 
7cr n_1 (x 3 ) 



Xi£0 E > ■ 



^ \ X„_i cr(x„_ 2 ) ••• cr n 1 (x ) / 



(3) 



Further optimization might be carried out by using e.g. ideals. If we denote the basis of E over Op by {1, ex, e n _x}, 
then the elements Xj, i = 0, n 



1 in ([3]) take the form Xi = X^fc=o where G Ojr for all fc = 0, 



1. 

Hence n complex symbols are transmitted per channel use, i.e. the design has rate n. In literature this is often 
referred to as having a. full rate. 

Definition 3.1: An algebra A is called simple if it has no nontrivial ideals. An F-algebra A is central if its 
center Z{A) = {a G A\ aa' = a' a W G A} = F. 

Definition 3.2: An ideal T is called nilpotent if Z k = for some k G Z+. An algebra .A is semisimple if it has 
no nontrivial nilpotent ideals. Any finite dimensional semisimple algebra over a field is a finite and unique direct 
sum of simple algebras. 

Definition 3.3: The determinant (resp. trace) of the matrix A is called the reduced norm (resp. reduced trace) 
of an element a G A and is denoted by nr(a) (resp. tr(a)). 

Remark 3.1: The connection with the usual norm map N^p(a) (resp. trace map T^/ F (a)) and the reduced 
norm nr(a) (resp. reduced trace tr(a)) of an element a G A is N A / F (a) = (nr(a)) n (resp. T^/p(a) = ntr(a)), 
where n is the degree of E/F. 

In Section [II] we have attested that the algebra H is a division algebra. The next old result due to A. A. Albert 
[[35], Chapter V.9] provides us with a condition for when an algebra is indeed a division algebra. 

Proposition 3.1: The algebra A = (E/F, a, 7) of index n is a division algebra, if and only if the smallest factor 
t G Z + of n such that 7* is the norm of some element in E* , is n. ■ 



B. Orders 

We are now ready to present some of the basic definitions and results from the theory of maximal orders. The 
general theory of maximal orders can be found in [36]. 

Let S denote a Noetherian integral domain with a quotient field F, and let A be a finite dimensional F-algebra. 
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Definition 3.4: An 5 -order in the F-algebra A is a subring A of A, having the same identity element as A, and 
such that A is a finitely generated module over 5 and generates A as a linear space over F. 

As usual, an 5-order in A is said to be maximal, if it is not properly contained in any other 5-order in A. If the 
integral closure 5 of 5 in A happens to be an 5-order in A, then 5 is automatically the unique maximal 5-order 
in A. 

Let us illustrate the above definition by the following example. 

Example 3.1: (a) Orders always exist: If M is a full 5-lattice in A, i.e. FM = A, then the left order of M 
defined as Oi(M) = {x G A \ xM C M} is an 5-order in A. The right order is defined in an analogous way. 

(b) If ^4. = M. n (F), the algebra of all n x n matrices over F, then A = M. n {S) is an 5-order in A. 

(c) Let a G A be integral over 5, that is, a is a zero of a monic polynomial over 5. Then the ring S[a] is an 
5-order in the F-algebra F[a\. 

(d) Let 5 be a Dedekind domain, and let E be a finite separable extension of F. Denote by 5 the integral closure 
of 5 in E. Then 5 is an 5-order in E. In particular, taking 5 = Z, we see that the ring of algebraic integers of E 
is a Z-order in E. 

Hereafter, F will be an algebraic number field and 5 a Dedekind ring with F as a field of fractions. 

Proposition 3.2: Let ^4 be a finite dimensional semisimple algebra over F and A be a Z-order in A. Let Of 
stand for the ring of algebraic integers of F. Then T = Of A is an O^-order containing A. As a consequence, a 
maximal Z-order in A is a maximal 0F-order as well. ■ 

The following proposition provides us with a useful tool for finding a maximal order within a given algebra. 

Proposition 3.3: Let A be an 5-order in A. For each a G A we have nr(a) G 5 and tr(a) G 5. ■ 

Proposition 3.4: Let V be a subring of A containing 5, such that FT = A, and suppose that each a G T is 
integral over 5. Then T is an 5-order in A. Conversely, every 5-order in A has these properties. ■ 

Corollary 3.5: Every 5-order in A is contained in a maximal 5-order in A. There exists at least one maximal 
5-order in A. ■ 

Remark 3.2: As the previous corollary indicates, a maximal order of an algebra is not necessarily unique. 

Remark 3.3: The algebra H can also be viewed as a cyclic division algebra. As it is a subring of the Hamiltonian 
quaternions, its center consists of the intersection Hfll = Q(\/2). Also Q(£) is an example of a splitting field of 
H. In the notation above we have an obvious isomorphism 

H~(Q(£)/Q(V2),<7,-1), 

where a is the usual complex conjugation. 

Remark 3.4: In principle, the lattices from Section |II] could also be used as MIMO codes, but when we pack H 
in the form of Q, 5c becomes vanishing and the DMT cannot be achieved. 

One extremely well-performing CDA based code taking advantage of a maximal order is the celebrated Golden 
code [8] (also independently found in [9]) treated in the following example. 

Example 3.2: In any cyclic algebra where the element 7 happens to be an algebraic integer, we have the following 
natural order 

K = O e ® uO E e • • • © u n - l o E , 

where Oe is the ring of integers of the field E. We note that Oe is the unique maximal order in E. In the so-called 
Golden Division Algebra (GDA) [8], i.e. the cyclic algebra (E/F,a,j) obtained from the data E = Q(i, V5), 
F = Q(i), 7 = i, n = 2, a(y / 5) = —y/5, the natural order A is already maximal [37]. The ring of algebraic 
integers Oe = Z[i][#], when we denote the golden ratio by = 1+ 2 V ^ . The authors of [8] further optimize the code 
by using an ideal (a) = (1 + i — iO), and the Golden code is then defmd as 



GC 



x ,x 1 eO E >. (4) 



1 / axo ia(a)o~(xi) 
y/5 \ axi a(a)a(x ) 

The Golden code achieves the DMT as the element 7 = i is not in the image of the norm map. For the proof, see 
[8]. 
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Remark 3.5: We feel that in [8], the usage of a maximal order is just a coincidence, as in this case it coincides 
with the natural order which is generally used in ST code designs (cf. (O). At least the authors do not mention 
maximal orders. As far as we know, but our constructions (see also [33]) there does not exist any designs using a 
maximal order other than the natural one. 

Next we prove that the lattice Lq is optimal within the cyclic division algebra H in the sense that the diamond 
lattice Eg = Ti(l + £) corresponds to a proper ideal of a maximal order in H. 

Proposition 3.6: The ring 

ft = {q = ci + £c2 + jc 3 + j£c 4 G H | ci, . . . , c 4 G Q(i), (1 + i)c t G Q Vt, c x + c 3 , c 2 + c 4 G 

is a maximal Z-order of the division algebra H. 

Proof: Clearly the Q-span of TL is the whole algebra H, and we have seen that TL is a ring, so it is an order 
of H. Furthermore, if A is any order of H, then so is A[V2] = A • Z[V2], as the element y/2 is in the center of H 
(cf. Proposition I3.2I ). Therefore it suffices to show that TL is a maximal Z[v2]-order. In what follows, we will call 
rational numbers in the coset (1/2) + Z half-integers. Assume for contradiction that we could extend the order TL 
into a larger order T = TL[q] by adjoining the quaternion q = tii + a 2 ], where the coefficients 

at = m ti0 + + m t ^ + m t ^ , m t) e G Q for all t, £ 

are elements of the field Q(£). As £ — £ 3 = a/2, and = — £ 3 , we see that 

tr(q) = a\+ a\= 2mi j0 + \/2(mi 5 i - mi i3 ). 

By Proposition 13.31 this must be an element of Z[y2], so we may conclude that mi o must be an integer or a 
half-integer, and that my — m\ .3 must be an integer. Similarly 

ir(g£) = -2mi i3 + v^my - my) 

must be an element of Z[\/2]. We may thus conclude that all the coefficients my, £ = 0, 1,2,3 are integers or 
half-integers, and that the pairs m^o, mi,2 (resp. my, rn-i^) must be of the same type, i.e. either both are integers 
or both are half-integers. A similar study of tr(qj) and tr(qj^) shows that the same conclusions also hold for the 
coefficients m2 t t, £ = 0,1,2,3. Because Z[£] C TL, replacing q with any quaternion of the form q — v, where 
v G Z[£] will not change the resulting order T. Thus we may assume that the coefficients my, I = 0, 1, 2, 3 all 
belong to the set {0, 1/2}. Similarly, if needed, replacing q with q — u'j for some v 1 G Z[£] allows us to assume 
that the coefficients m 2 /, I = 0, 1, 2, 3 also all belong to the set {0, 1/2}. Further replacements of q by q — p or 
q — pi then permit us to restrict ourselves to the case mi,t = 0, for all I = 0, 1, 2, 3. If we are to get a proper 
extension of TL, we are left with the cases q = (l + i)/2, q = £(l + i)/2 and q = (1 + £)(1 + i)/2. We immediately 
see that none of these have reduced norms in Z[y2], so we have arrived at a contradiction. ■ 
Remark 3.6: Another related well known maximal order is the icosian ring. It is a maximal order in another 
subalgebra of the Hamiltonian quaternions, namely 

(Q(i,V5)/Q(V5) )( j,-l), 

where a is again the usual complex conjugation. This order made a recent appearance as a building block of a 
MIMO-code in a construction by Liu & Calderbank. We refer the interested reader to their work [38] or [31] for 
a detailed description of this order. 

The icosian ring and our order share one feature that is worth mentioning. As 2 x 2 matrices they do not have the 
non-vanishing determinant property. Algebraically this is a consequence of the fact the respective centers, Q(a/5) or 
Q(\/2) both have arbitrarily small algebraic integers, e.g. the sequence consisting of powers of the units (\/5 — 1) /2 
(resp. v2 — 1) converges to zero. We shall return to this point in the next section, where a remedy is described. 
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IV. Decoding of the nested sequence of lattices 

In this section, let us consider the coherent MIMO case where the receiver perfectly knows the channel coeffi- 
cients. The received signal is 

y = Bx + n, 

where x £ M m , y, n £ M. n denote the channel input, output and noise signals, and B £ ]j nxm j s the Rayleigh 
fading channel response. The components of the noise vector n are i.i.d. complex Gaussian random variables. In 
the special case of a MISO channel (n = 1), the channel matrix takes a form of a vector B = h £ M m (cf. Section 

B- 

The information vectors to be encoded into our code matrices are taken from the pulse amplitude modulation 
(PAM) signal set X of the size Q, i.e., 

X = {u = 2q-Q + l | qeZ Q } 

with Z Q = {0,1,..., Q-l}. 

Under this assumption, the optimal detector g : y h-> x £ X m that minimizes the average error probability 

P{e) = P(x ^ x) 
is the maximum-likelihood (ML) detector given by 

x = arg min xeZ ™ | y - Bx | 2 , (5) 
where the components of the noise n have a common variance equal to one. 



A. Code controlled sphere decoding 

The search in (f5]) for the closest lattice point to a given point y is known to be NP-hard in the general case 
where the lattice does not exhibit any particular structure. In [39], however, Pohst proposed an efficient strategy of 
enumerating all the lattice points within a sphere S(y, \/Co) centered at y with a certain radius y/Co that works 
for lattices of a moderate dimension. For background, see [40]-[43]. For finite PAM signals sphere decoders can 
also be visualized as a bounded search in a tree. 

The complexity of sphere decoders critically depends on the preprocessing stage, the ordering in which the 
components are considered, and the initial choice of the sphere radius. We shall use the standard preprocessing and 



ordering that consists of the Gram-Schmidt orthonormalization B = (Q,Q') [qj °f tne c °l umn s of the channel 

matrix B (equivalently, QR decomposition on B) and the natural back-substitution component ordering given by 
x m , ...,x\. The matrix R is an m x m upper triangular matrix with positive diagonal elements, Q (resp. Q') is an 
n x m (resp. n x (n — m)) unitary matrix, and is an (n — m) x m zero matrix. 
The condition Bx £ 5(y,\/Co) can be written as 

| y - Bx \ 2 < Co (6) 
which after applying the QR decomposition on B takes the form 

| y' - Rx | 2 < C' Q , (7) 
where y' = Q T y and C' Q = Cq — \ (Q') T y\ 2 - Due to the upper triangular form of R, implies the set of conditions 

m m g 

Yjy'j ~^2 r jJ x t - C 'o> i = l,-,m. (8) 

j=i l=j 



The sphere decoding algorithm outputs the point x for which the distance 



2 



d 2 (y,Bx) =Yjy'i -^2 r J,e x £ 



(9) 



is minimum. See details in [43]. 
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TABLE II 

CCSD: Additional case considerations 



CASE L 4 


£f =1 x 4 = (mod 2) 


CASE L 5 


Xl + x 2 = x 5 + x 6 , 

x 3 + X4 = x 7 + x 8 (mod 2) 


CASE L 6 


Xx + x 2 = x 3 + x 4 = x 5 + Xq = x 7 + x 8 , 
E 2 |< ^ = Sap ^ = ( mod 2 ) 



The decoding of the base lattice L2 can be performed by using the algorithm below proposed in [43]. 
Algorithm II, Smart Implementation (Input Cg, y', R. Output x.) 

STEP 1: (Initialization) Set i := m, T m := 0, £ m := 0, and d c := C' (current sphere squared radius). 
STEP 2: (DFE on x^ Set Xi := |_(^ - &)/ri/| and A; := sign^ - & - r^Xj). 
STEP 3: (Main step) If d c < T t + | y\ - & 

— r i,i x i I , then go to STEP 4 (i.e., we are outside the sphere). 
Else if Xi TLq go to STEP 6 (i.e., we are inside the sphere but outside the signal set boundaries). 
Else (i.e., we are inside the sphere and signal set boundaries) if i > 1, then 
{let := J2f=i n-ijXj, T<_i := T;+ | ^ - & - r M x* | 2 , i := * - 1, and go to STEP 2}. 
Else (i=l) go to STEP 5. 

STEP 4: If i = m, terminate, else set i := i + 1 and go to STEP 6. 

STEP 5: (A valid point is found) Let d c := T\+ \ y[ — £1 — ri^xi | 2 , save x := x. 
Then, let i := i + 1 and go to STEP 6. 

STEP 6: (Schnorr-Euchner enumeration of level i) Let Xj := Xi + Aj, Aj := — Aj — szgn(Aj). 
Then, go to STEP 3. 

Note that given the values Xj + i, x m , taking the ZF-DFE (zero-forcing decision-feedback equalization) on Xj 
avoids retesting other nodes at level % in case we fall outside the sphere. Setting d c = 00 would ensure that the first 
point found by the algorithm is the ZF-DFE point (or the Babai point) [43]. However, if the distance between the 
ZF-DFE point and the received signal is very large this choice may cause some inefficiency, especially for high 
dimensional lattices. 

The decoding of the other three lattices in dTJ also relies on this algorithm, but we need to run some additional 
parity checks. This simply means that in addition to the checks concerning the facts that we have to be both inside 
the sphere radius and inside the signal set boundaries, we also have to lie inside a given sublattice. This will be 
taken care of by a method we call code controlled sphere decoding (CCSD), that combines the algorithm above 
with certain case considerations. To this end, let us write the constraints on the elements c, as modulo 2 operations. 
Denote by x = (xi, x 2 , x 8 ) = (Kci, 9ci, 3^4) E M 8 the real vector corresponding to the channel input. 
Note that when exploiting these relations in the CCSD algorithm, we have to use different orderings for the basis 
matrices of the lattice in different cases in order to make the parity checks as simple as possible. Let us first order 
the basis matrices as B x = M(l, 0, 0, 0), B 2 = M(i, 0,0,0), ...,B 7 = M(0,0,0, 1),-B 8 = M(0,0,0,i). Then when 
decoding e.g. the L5 lattice, we reorder the basis matrices as B\, B 2 , B$, Bq, B3, B4, B7, Bg, in order to get the 
sum ci + C3 as the sum of the first 4 components and the sum c 2 + C4 as the sum of the last 4 components (cf. 
Proposition I2.5I ). The conditions for the Gaussian elements of Propositions |2.4||2!6l can clearly be translated into the 
following modulo 2 integer conditions, see for instance Remark 12.21 The additional parity check steps will hence 
be as shown in Table JI] above. 

As the Alamouti scheme [2] has a very efficient decoding algorithm available, and our quaternionic lattices have 
an Alamouti-like block structure, it is natural to ask whether any of the benefits of Alamouti decoding will survive 
for our lattices. We shall see that the block structure allows us to decode the two blocks independently from each 
other. The following simple observation is the underlying geometric reason for our ability to do this. 
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Average complexity at 4 bpcu Average complexity at 8 bpcu 




17 18 19 20 21 22 23 24 25 26 29 30 31 32 33 34 35 36 37 38 39 



SNR (dB) SNR (dB) 



Fig. 1. Average complexity of 4 tx-antenna matrix lattices at rates (approximately) R — 4 and R — 8 bpcu. 



Lemma 4.1: Let A and B be two n x n matrices with the property that the matrices A,B,A H ,B H commute. 
Let h G C 2n be any (row) vector and write 

M(A, B)=( _ A Bli In ) . 

Then the vectors hM(A, 0) and hM(0, B) are orthogonal to each other when we identify C 2n with M and use 
the usual inner product of a vector space over the real numbers. 

Proof: With the identification C 2n = M 4n the real inner product is the real part of the hermitian inner product 
( , ) of C 2n . Write the vector h in the block form h = (h^\ M 2 )), where the blocks ffi\j = 1,2, are (row) 
vectors in C n . Then we can compute 

(hM(A,0),hM(0,B)) 
= {hM(A,0)M(0,B) H ,h) 
= {hM(A,0)M(0,-B),h) 
= (hM(0,-AB),h) 
= (h^A H B H ,hW)-(h^AB,h^). 

As (uM, v) = (vM , u)* for all vectors u, v and matrices M, we see that the above hermitian inner product is 
pure imaginary. ■ 

Corollary 4.2: Let A and B range over sets of n x n-matrices. Let h and r be vectors in C 2n . Then the Euclidean 
distance between r and hM(A, B) is minimized for the A = Aq and B = Bq, when Aq minimizes the Euclidean 
distance between r and h.M(A, 0) and Bq minimizes the Euclidean distance between r and hM(0,B). 

Proof: Write Va (resp. Vb) for the real vector space spanned by the vectors hM(A, 0) (resp. hM(0, B)). 
These subspaces are orthogonal to each other in the sense of Lemma 14.11 Whence we can uniquely write r = 
fA + fB + r±, where ta G Va, tb G Vb and r± is in the (real) orthogonal complement of the direct sum Va®Vb- 
A similar decomposition for the vector hM(A,B) is hM(A,B) = Ua + hs, where Ua = hM(A,0) £ Va and 
h B = hM(0, B) G Vb. By the Pythagorean theorem 

|r - hM(A,B)\ 2 = \r A - hM(^,0)| 2 + \r B - hM(0,B)\ 2 + |r ± | 2 . 

Furthermore, here 

\r A - hM(A0)| 2 = |r- hM(^,0)| 2 - |r B | 2 - |r ± | 2 , 
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so the quantities \ta — hM(^4,0)| 2 and |r — hM(^4,0)| 2 are minimized for the same choice of the matrix A. A 
similar argument applies to the B -components, so the claim follows. ■ 

B. Complexity issues and collapsing lattices 

The number of nodes in the search tree is used as a measure of complexity so that the implementation details 
or the physical environment do not affect it. We have analyzed many different kinds of situations concerning the 
change of complexity of the sphere decoder when moving in £[]) from right to left. 

In Fig. Q] we have plotted the average number of points visited by the algorithm in different cases at the rates 
approximately 4 and 8 bpcu. The SNR regions cover the block error rates between « 10% — 0.01%. As can be 
seen, in the low SNR end, the difference in complexity between the different lattices is clear but evens out when 
the SNR increases. For the sublattices L4, L5, and Lq the algorithm visits 1.1 — 2.1 times as many points as for 
the base lattice L 2 . In the larger SNR end, the performance is fairly similar for all the lattices. E.g. at 4 and 8 
bpcu, when all the lattices reach the bound of maximum 20 points visited, the block error rates of L4, L5, and Lq 
are still as big as 5%, 2%, and 1% respectively. 

Definition 4.1: In a MISO setting we say that a matrix lattice L of rank m collapses at a channel realization 
h, if the receiver's version of the lattice hX spans a real vector space of dimension < m. We call the set of such 
channel realizations the critical set. We say that the sensitivity s(L) (towards collapsing) of the lattice L is r, if 
the critical set is a union of finitely many subspaces of real dimension < r. 

So we e.g. immediately see that a lattice residing in an orthogonal design will have zero sensitivity. While we 
have no precise results the thinking underlying the concept can be motivated as follows. When the infinite lattice 
collapses into a lower dimensional space, its linear structure is severely mutilated. For example the minimum 
Euclidean distance drops to zero — for any e > there will be infinitely many other lattice points within a distance 
< e. Even when we restrict ourselves to a finite subset of the lattice, the coordinates of the nearby points may 
differ drastically. Thus even an ML-decoder will have problems, and an algorithm relying on the orderly linear 
structure of the lattice (like the sphere decoder) cannot work very efficiently. Similar problems are still there, when 
the actual channel realization h is close to a critical vector. 

The sensitivity then enters the scene as a crude measure for the probability of this happening. It is easy to see 
that in a Rayleigh fading channel the probability of the channel vector h to be within e of a critical vector behaves 
like 0(e 2n ~ s ). Thus the lower the sensitivity, the lower the probability of the lattice becoming distorted by the 
channel. 

We lead off by determining the sensitivity of the DAST-lattices. 

Example 4.1: There exist 8-dimensional lattices [5] of 4 x 4 matrices of the form 

I x 1 x 2 x 3 x 4 ^ 
X\ -X2 x 3 -x 4 
x\ x 2 -X3 -x 4 

\ X\ -X 2 ~X 3 X4 J 

These matrices are simultaneously diagonalizable as they have common orthogonal eigenvectors hi = (1,1,1,1), 
I12 = (1, —1, 1, —1), h.3 = (1, 1, —1, —1) and Ii4 = (1, —1, —1, 1)4. Write the channel vector in terms of this basis 



Mdast 



h = ^2j = i<ijhj. If any of the coefficients vanishes, say = 0, then the DAST-lattice collapses, because the 
receiver's version of the lattice will belong to the complex span of the other three eigenvectors hj, j ^ k. On the 
other hand, if all the coefficients aj 7^ 0, j = 1,2,3,4, this channel vector will not be critical. One way of seeing 
this is that applying the linear mapping determined by hj h-> (l/a,j)hj to the receiver's lattice then recovers the 
original full rank lattice of vectors (x%, x 2 , X3, X4). Such a mapping obviously cannot affect the dimension of the 
space spanned by the vectors, so the lattice won't collapse. 
We have shown that the sensitivity of the DAST-lattice is six. 

We proceed to determine the sensitivities of the lattices L\ of Proposition 12.21 and the ones within the nested 
sequence (Q]). Let us first consider L\. Let 

/ hi 
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Fig. 2. The impact of sensitivity on complexity, L\ (~ Ldast) vs Z/2- 



be the 4x4 matrix with rows hi, h 2 ,h 3 ,h 4 of the form (1, ( j , ( 2j , ( 3j ) for j = 1,5,9,13. Recall that earlier 
we have used {1,C>C 2 >C 3 } as an integral basis, so the rows of U are the images of this ordered basis under the 
action of the Galois group G of the extension Q(C)/Q(0- Now it happens that the matrix U is unitary (up to 
a constant factor) as UU* = 4/4. Let z = c\ + C2C + csC 2 + C 4C 3 be an arbitrary algebraic integer of Q(C), 
and M{z) = Ml{c\, C2, C3, C4) S L\ be the corresponding matrix of Proposition 12.21 According to the theory of 
algebraic numbers (and also trivially verified by hand) the rows of U are (left) eigenvectors of M(z), and 

/ z \ 

a 2 (z) 

03(2) 

\ o- 4 (z) / 

is a diagonal matrix with entries gotten by applying the elements of the Galois group G = {a\ = id, 0-2,0-3, 0-4} to 
the number z. 

So all the matrices Ml(c\, C2, 03,04) are diagonalized by U. Therefore we might call the lattice L\ 'DAST-like', 
as it shares this property with the lattices from [5]. 

Proposition 4.3: The lattice L% has sensitivity six. 



UM{z)U- 1 = 
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Proof: The situation is completely analogous to that of Example 14. 11 The lattice L\ will collapse, iff the channel 
realization belongs to any of the 4 complex vector spaces spanned by any three of the common eigenvectors. ■ 
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Fig. 3. The scaled impact of sensitivity on complexity, L\ (~ Ldast) vs L2. 



In order to study the quaternionic lattices we first observe that the 2 x 2-matrices A and B appearing as blocks 
of a matrix M £ L2 all have (1, ±£) as their common (left) eigenvectors. The same holds for the adjoints A*, B* 
as they also appear as blocks of M* that also happens to belong to the lattice L2. From the proof of Proposition 
2.41 we see that the matrix MM*, M = M(ci, 02,03,04), has eigenvalues a±|fc| with respective (left) eigenvectors 
(1, ±£, 0, 0) and (0, 0, 1, ±£). Here a = Y,j=i \ c j\ 2 and & = - * c l^ + c 2^ - ic 3 C4 + C4C3. We make this more 
precise before we determine the sensitivity of the quaternionic lattices. 

There is a connection between our MISO-code and the multi-block codes introduced by Belfiore in [45] and Lu 
in [44] that can be best explained with the notation of the present section. Consider the unitary matrix with the 
above basis vectors as columns 

/ 1 1 \ 
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If we conjugate the matrices of the algebra H by U we get matrices of the form 



/ 












\ 




X2 
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• r i 


















r(xi) 


-r(x 2 ) 




V 








r(x 2 ) 




/ 



where the elements xi,x 2 belong to the field Q(£) = Q(i, v2), and r : Q(£) — ► Q(£) is the automorphism 
determined by r(i) = i, r(v2) = — v2- Thus we see that our MISO-code is unitarily equivalent to a multi-block 
code with a structure similar to [44] — only our center is smaller. 

The upshot here, as well as in [45], [44], and in the icosian construction from [38] is that while the individual 
diagonal blocks may have arbitrarily small determinants, when we use them together with their algebraic conjugates, 
the diagonal blocks together conspire to give a non-vanishing determinant. This is because the algebraic conjugates 
of small numbers are necessarily just large enough to compensate as the algebraic norms are known to be integers. 

Another benefit enjoyed by our matrix representation of the algebra H over the above multi-block representation 
is that the signal constellation is better behaved. Surely the simple QAM-constellation of our matrices is to be 
preferred over the linear combinations of two rotated QAM-symbols of the multi-block representation. 

This feature clearly begs to be generalized to a MIMO-setting. One such construction is the previously mentioned 
icosian construction of Liu & Calderbank [38], where they managed to add a multiplexing gain of 2 to a similar 
multi-block representation of the icosians. It turned out that the question of how to best do this in the spirit of 
the present article is somewhat delicate. The resulting codes will necessarily be asymmetric MIMO-codes, and we 
refer the reader to [46]. 

We return to the sensitivity of the quaternionic lattices. The following result is now easy to verify 
Proposition 4.4: Let V+ (resp. V-) be the complex subspace of C 4 generated by the vectors (1,£, 0,0) and 

(0, 0, 1, £) (resp. by (1, — £, 0, 0) and (0, 0, 1, — £)). The subspaces V + and V_ are orthogonal complements of each 

other in C 4 , so any channel vector can be uniquely written as 

h = h+ + h_, 

where h± G V± respectively. If h belongs to one of the subspaces V+,V-, the lattice hL 2 collapses. Otherwise 
the lattice L 2 does not collapse. In particular the sensitivity of the lattices L 2 , L3, L4, L5, Lq is four. ■ 

Our simulations, indeed, show that the complexity of a sphere decoder increases sharply, when we approach the 
critical set. A comparison between the lattices L\ and L 2 does not show a dramatic difference between the average 
complexities of a sphere decoder, but the difference becomes very apparent, when studying the high-complexity 
tails of the complexity distribution. 

In Fig. [2] we have plotted the complexity distribution of 5000 transmissions for different data rates. On the 
horizontal axis the quantity min( |hj| 2 ) (resp. min( |h + | 2 ,|h_| 2 )) describes how close the lattice L\ (resp. 
L 2 ) is to the situation where it would collapse. That is, how close to zero the minimum of the components 
hi G Vi, i = 1,2,3,4, (resp. h± G V±) gets (cf. Remark |4~3l and Proposition 14.41 ). For both L\ and L 2 the figure 
shows that the smaller the quantity, the higher the complexity. We can also conclude that the lattice L\ nearly 
collapses a lot more often than the lattice L 2 . In addition, the number of points visited by the sphere decoding 
algorithm is much higher for L\ than for L 2 . These are phenomena caused by the higher sensitivity of L\. In Fig. 
[3] the scaled impact of sensitivity is depicted. 

Note that as Ldast has the same sensitivity as L\, we can equally well analyze the behavior of the DAST 
lattice on the basis of Fig. [2] and Fig. [3] 

V. Energy considerations and simulations 
As a summary of Propositions 12.2142.61 we get the following. 

Proposition 5.1: (1) The lattice L 2 is isometric to the rectangular lattice Z 8 and has a minimum determinant 
equal to 1. 

(2) The lattice L4 isometric to D$ is an index two sublattice of L 2 and has a minimum determinant equal to 4. 

(3) The lattice L5 isometric to is an index four sublattice of L 2 and has a minimum determinant equal 
to 16. 
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(4) The lattice L$ isometric to Eg is an index 16 sublattice of L2 and has a minimum determinant equal to 64.B 

In order to compare these lattices we scale them to the same minimum determinant. When a real scaling factor 
p is used the minimum determinant is multiplied by p 2 . As all the lattices have rank 8, the fundamental volume 
is then multiplied by p 8 . Let us choose the units so that the fundamental volume of L2 is miL^) = 1- Then after 
scaling m{L±) = 1/2, m(L^) = 1/4, and m(L§) = 1/4. As the density of a lattice is inversely proportional to 
the fundamental volume, we thus expect the codes constructed within e.g. the lattices L4 and Lq to outperform the 
codes of the same size within L2. 



Average energy of some 4 Tx lattices Block error rates at 2 bpcu 

300 I ! 1 1 1 1 1 ! ! i - i 10° r 1 7T7. ! 1 r 




Rate (bpcu) SNR (dB) 

Fig. 4. Average energy (left) and block error rates of 4 Tx-antenna lattices at 2 bpcu with one receiver (right). 



The exact average transmission power data in Fig. @] is computed as follows. Given the size K of the code we 
choose a random set of K shortest vectors from each lattice. The average energy of the code 



K 

is then computed with the aid of theta functions [31]. All the lattices were normalized to have minimum determinant 
equal to 1. When using the matrices M(c\, 02,03,04) of Proposition 12.11 in some cases we are better off selecting 
the input vectors (ci, 02,03, 04) from the coset |(1 + i, 1 + i, 1 + i, 1 + i) + Q A instead of letting them range over 
Q 4 . Obviously such a translation does not change the minimum determinant of the code, but it sometimes results 
in significant energy savings. E.g. to get a code of size 256 it is clearly desirable to let the coefficients c\, ca, 03, 04 
range over the QPSK-alphabet. 

Fig. [5] shows the block error rates of the various competing lattice codes at the rates approximately 2, 4, 6, and 8 
bpcu, i.e. all the codes contain roughly 2 8 , 2 16 , 2 24 or 2 32 matrices respectively. For the lattices L\, L2, Ldast, and 
Labba [20] this simply amounted to letting the coefficients 01,02,03,04 take all the values in a QPSK-alphabet. 
Therefore, it would have been easy to obtain bit error rates as well. For the lattices L4, L5, Lq the rate is not 
exact, see ([TOl below and the preceding explanation. Of course also the exact rate equal to a power of two could 
be achieved by just choosing a more or less random set of shortest lattice vectors. As there is no natural way to 
assign bit patterns to vectors of Dg, D4-LD4 or Eg, we chose to show the block error rates instead of the bit error 
rates. 

The simulations were set up, here, so that the 95 per cent reliability range amounts to a relative error of about 3 
per cent at the low SNR end and to about 10 per cent at the high SNR end (or to about 4000 and 400 error events 
respectively). One receiver was used for all the lattices. 

When moving left in £T|) the minimum determinant increaces while the BLER decreases at the same time. 
However, the other side of the coin is that improvements in the BLER performance cause a slightly more complex 
decoding process by increasing the number of points visited in the search tree. Still after this increasement, even 
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Block error rates at 2 bpcu 



Block error rates at 4 bpcu 




25 26 27 

SNR (dB) 



32 33 

SNR (dB) 



Fig. 5. Block error rates of 4 tx-antenna lattices at approximately 2.0, 4.0, 6.0, and 8.0 bpcu with one receiver. 



the lattice Lq admits a fairly low average complexity as compared to the lattices L\ and Least due to its lower 
sensitivity. In part of the pictures in Fig. [5j the order of the curves seems not to respect the above mentioned order, 
but this only happens because the rates are not exactly the same for all the lattices. E.g. at the rate 4 bpcu, 
the exact rates for L2,L^,Lq, and Lq are 4,3.75,4.14, and 4.17 bpcu respectively. Consequently, the lattice L\ 
seems to perform better than what it actually does. Let us shortly explain how these rates follow: when picking 
the elements x\,...,x% from the set Zq (cf. Section [TV] (0) and the discussion after Algorithm II), the size of the 

O 8 lo Q 8 

code within the lattice L i: i = 2,4,5,6, will be rg^n = 2° E i t ! :t .i, where [L2 : Li] is the index of the sublattice 
Li inside L2 (cf. Proposition 15 -lb - Hence, the data rate in bits per channel use can be computed as 

R J° S T&\, (10) 
4 

Now, for instance, to get as close to the rate R = 4 bpcu as possible, we have to choose Q = 4, Q = 4, Q = 5, 
and Q = 6 for the lattices L2,L^,L^, and Lq respectively. By substituting Q and the sublattice index in question 
to (TTOb we obtain the above rates. 

Simulations at the rate 6 bpcu with one receiver show that the lattice L 6 wins by approximately 1 dB over the 
lattice L2 and by at least 2.5 dB over Ldast- At the rate 2 bpcu, the rotated ABBA lattice Labba is already 
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beaten by the L2 lattice by a fraction of a dB. The difference between L2 and Ldast is even clearer: L2 gains 
1 — 2 dB over Ldast, depending on the SNR. At all data rates the lattice Lq outperforms all the other lattices. 

Prompted by the question of one of the reviewers, we make the following remark in case that the reader is 
familiar with the Icosian code [38] and ponders over whether and how it relates to the codes presented in this 
paper. 

Remark 5.1: The Icosian lattice Licosian presented in [38] takes use of the Icosian ring (cf. Remark [331 ) 
and has a similar looking structure to the Golden code [11], where the matrix elements are replaced with Icosian 
Alamouti blocks 



A code within this lattice is called Icosian code. Note that Jafarkhani's quasi-orthogonal code [30] in the simulations 
of [38] is exactly our base lattice L2. 

First of all, note that the Icosian code has code rate two, as the lattice is 16-dimensional over the reals. Hence, 
in order to enable efficient linear decoding, at least two antennas are required at the receiving end. Taking this into 
consideration, there is no good way to make fair comparison between the Icosian lattice and the 8-dimensional 
lattices proposed in this paper. If the application at hand allows us to use one receiving antenna only, we either 
have to puncture Licosian (e-g- by setting B = 0) which will cause it to lose its benefits, or, we need to perform 
complex decoding process (e.g. a sphere decoder cannot be used). 

However, if we still want to compare these codes with two receivers, our codes will of course lose due to the 
lower code rate as they are designed for MISO use only. Similar comparison could be done e.g. with the 4x4 
Perfect code [11] and the Icosian code resulting to the loss of the Icosian code due to its lower rate (two vs. four). 
When using one receiver for the Icosian code by punctring the block B, it will lose to L2 by 0.5-1 dB at 2 bpcu 
depending on the SNR as depicted in Figure [4] But, as noted above, in this way Licosian will of course lose its 
benefits (as we are not really using the whole Icosian ring) so this is not a comparison on which we should put 
too much value. 

To conclude, the codes in this paper and the Icosian code are targeted into different types of applications: the 
first ones are aimed for systems with one receiving antenna, whereas the Icosian code naturally fits into systems 
with two receiving antennas. 

VI. Diversity-multiplexing tradeoff analysis 

This section contains the DMT analysis of the MISO codes constructed in this paper. We denote by n t (resp. 
n r ) the number of transmitting (resp. receiving) antennas. For the rest of the notation, see [21]. 
Let us first consider the number field construction. Denote (cf. Proposition 12.21 ) 



k \ C4 c 3 c 2 ci / J 

where A C Z[i] is some constellation set. This code is for the MISO system with nt = 4 transmit and n r = 1 
receive antennas. Given the transmit code matrix X <E Li, the received signal vector is 







y T = 9h T X + n_ 



T 



where h~ CAA(0,/ 4 ). 
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Let r be the desired multiplexing gain; then we need 

\L X \ = SNR 4r = \A\ A 



and the above in turn gives 
Hence we see for every Ci e A 
and 



= SNFT. 
\\a\\ 2 < SNR r 
e 2 = SNR 1 r . 



(11) 

(12) 
(13) 



Let A := \\h\\ 2 F = SNR a and let S± > ■ ■ ■ > 64 be the ordered eigenvalues of XX^; then the random Euclidean 
distance is lower bounded by 

>SMR El > 



d 2 E > e 2 \5 4 



nti Si 



where 



E Fl = 1 — r — a — 3r = l — 4r — a. 
Now the DMT of this code is given by 

di Jl (r) > inf 4a = 4(1 - 4r), for < r < 

El 1 <o 4 

while the optimal tradeoff in this channel is actually 

d*(r) = 4(1 -r) for < r < 1. 

The quaternionic construction is 



, C% £ A 



First of all, as pointed out in the proof of Proposition 2.4, the matrix M £ L2 is of the following form: 

A -B H 



(14) 
(15) 

(16) 
(17) 





( C1 


ic 2 


c 3 


-4 \ 




C2 


Cl 


ic| 


-4 
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C3 




r* 






V c 4 


C3 


-zc 2 


<A J 



M = 



B A 



H 



and 



MM" = (AA" + B«B \ 

\ a'm + .rb^ y 

AA H + 

AA H + 55^ 





since AB = BA. Thus the ordered eigenvalues of MM H satisfy 5\ = 62 > <$3 = (54 and in particular, <5i > £3 
are the ordered eigenvalues of AA H + BB H . Secondly, note that MM H satisfies the non-vanishing determinant 
property, and so does the matrix ^4^4^ + BB H . Now the bound for the random Euclidean distance is 

4 > 2 X5 4 = ^ >SNR Bi % (18) 

where 

E L2 = l-r-a-r = l-2r-a. (19) 
Now the DMT of this code is given by 



d L2 ( r ) > inf 4 « = 4(1 -2r), forO<r<-. 

E Lo <0 2 



(20) 
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The same of course also holds for codes within the sublattices L^L§,Lq C L2. 

Remark 6.1: While our codes are not DMT optimal, it has to be noticed that without using a full-rate code the 
DMT cannot be achieved. Hence, if one wishes to enable efficient decoding process with one receiving antenna 
only (see the remark below), sacrifices in terms of the DMT have to be made. However, our quaternionic lattices 
L2, -L4, -L5, -^6 admit higher DMT as e.g. the DAST lattice, as the DMT of the DAST lattice coincides with that of 
Li. 

Remark 6.2: One might ponder why not use e.g. the full-rate CDA based codes (cf. [6], [11]) as they are DMT 
optimal provided that they have non-vanishing determinant. The answer to this is in principle the same as the one 
provided in Remark 15711 We could naturally do this, but considering that we only want to use one receiving antenna 
it should be clear that a full-rate code cannot be efficiently used. Indeed, using a full-rate code would destroy the 
lattice structure and cause exponential complexity at the receiver. To enable efficient decoding with one receiver 
we have to limit ourselves to rate-one codes, which exactly we have done in this paper. We want the reader to 
note that full-rate codes (e.g. the perfect codes [11]) are optimally suited for systems with i%t = n r > 1, hence 
inapplicable to the purposes of this paper where we have n t = 4 and n r = 1. 

VII. Conclusions and suggestions for further research 

In this paper, we have presented new constructions of rate-one, full-diversity, and energy efficient 4x4 space- 
time codes with non-vanishing determinant by using the theory of rings of algebraic integers and their counterparts 
within the division rings of Lipschitz' and Hurwitz' integral quaternions. A comfortable, purely number theoretic 
way to improve space-time lattice constellations was introduced. The use of ideals provided us with denser lattices 
and an easy way to present the exact proofs for the minimum determinants. The constructions can be extended 
also to a larger number of transmit antennas, and they nicely fit with the popular Q 2 -QAM and QPSK modulation 
alphabets. The idea of finding denser sublattices within a given division algebra was also generalized to a MIMO 
case with arbitrary number of Tx antennas by using the theory of cyclic division algebras and, as a novel method, 
their maximal orders. This is encouraging as the CDA based square ST constructions with NVD are known to 
achieve the DMT. We have also shown that the explicit constructions in this paper all have a simple decoding 
method based on sphere decoding. Related to the decoding complexity, the notion of sensitivity was introduced for 
the first time in this paper. The experimental results have given evidence about the relevance of this new notion. 

Comparisons with the four antenna DAST block code have shown that our codes provide lower energy and block 
error rates due to their good minimum determinant, i.e. high density and lower sensitivity. At the moment, we are 
searching for well-performing MIMO codes arising from the theory of crossed product algebras and maximal orders 
of cyclic division algebras. We have noticed that also the discriminant of a maximal order plays an important role 
in code design. It is desirable to choose cyclic division algebras for which the discriminant of a maximal order 
is as small as possible [33]. By now, we are able to construct an explicit cyclic division algebra of an arbitrary 
index over Q(i) (or Q(uj)) that has a maximal order with minimal discriminant. Despite the fact that we have 
not yet fully analyzed the practical performance of codes arising from these constructions, the preliminary results 
have been very promising. Further details on this and on the algorithmic properties of maximal orders (see also 
[47]-[49]) will be given in a forthcoming paper [33]. 
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